Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 748 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 210.1 KiB |
| Average record size in memory | 287.6 B |
Variable types
| Categorical | 4 |
|---|---|
| Numeric | 7 |
Volumen (miles de kg) is highly correlated with Valor (miles de €) | High correlation |
Valor (miles de €) is highly correlated with Volumen (miles de kg) | High correlation |
Consumo per capita is highly correlated with Gasto per capita | High correlation |
Gasto per capita is highly correlated with Consumo per capita | High correlation |
Fecha is highly correlated with Año | High correlation |
Año is highly correlated with Fecha | High correlation |
Fecha is uniformly distributed | Uniform |
CCAA is uniformly distributed | Uniform |
Producto is uniformly distributed | Uniform |
Volumen (miles de kg) has unique values | Unique |
Valor (miles de €) has unique values | Unique |
Reproduction
| Analysis started | 2021-04-14 06:38:32.406968 |
|---|---|
| Analysis finished | 2021-04-14 06:38:38.597896 |
| Duration | 6.19 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 22 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 46.9 KiB |
| 2018-07 | 34 |
|---|---|
| 2019-05 | 34 |
| 2018-10 | 34 |
| 2018-09 | 34 |
| 2019-11 | 34 |
| Other values (17) |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 5236 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2018-03 |
|---|---|
| 2nd row | 2018-03 |
| 3rd row | 2018-03 |
| 4th row | 2018-03 |
| 5th row | 2018-03 |
| Value | Count | Frequency (%) |
| 2018-07 | 34 | 4.5% |
| 2019-05 | 34 | 4.5% |
| 2018-10 | 34 | 4.5% |
| 2018-09 | 34 | 4.5% |
| 2019-11 | 34 | 4.5% |
| 2019-06 | 34 | 4.5% |
| 2018-06 | 34 | 4.5% |
| 2018-08 | 34 | 4.5% |
| 2018-05 | 34 | 4.5% |
| 2020-06 | 34 | 4.5% |
| Other values (12) | 408 |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 2018-07 | 34 | 4.5% |
| 2019-05 | 34 | 4.5% |
| 2018-10 | 34 | 4.5% |
| 2018-09 | 34 | 4.5% |
| 2019-11 | 34 | 4.5% |
| 2019-06 | 34 | 4.5% |
| 2018-06 | 34 | 4.5% |
| 2018-08 | 34 | 4.5% |
| 2018-05 | 34 | 4.5% |
| 2020-06 | 34 | 4.5% |
| Other values (12) | 408 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1564 | |
| 2 | 884 | |
| 1 | 816 | |
| - | 748 | |
| 8 | 374 | 7.1% |
| 9 | 374 | 7.1% |
| 3 | 102 | 1.9% |
| 4 | 102 | 1.9% |
| 5 | 102 | 1.9% |
| 6 | 102 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4488 | |
| Dash Punctuation | 748 | 14.3% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 1564 | |
| 2 | 884 | |
| 1 | 816 | |
| 8 | 374 | 8.3% |
| 9 | 374 | 8.3% |
| 3 | 102 | 2.3% |
| 4 | 102 | 2.3% |
| 5 | 102 | 2.3% |
| 6 | 102 | 2.3% |
| 7 | 68 | 1.5% |
| Value | Count | Frequency (%) |
| - | 748 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5236 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 1564 | |
| 2 | 884 | |
| 1 | 816 | |
| - | 748 | |
| 8 | 374 | 7.1% |
| 9 | 374 | 7.1% |
| 3 | 102 | 1.9% |
| 4 | 102 | 1.9% |
| 5 | 102 | 1.9% |
| 6 | 102 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5236 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 1564 | |
| 2 | 884 | |
| 1 | 816 | |
| - | 748 | |
| 8 | 374 | 7.1% |
| 9 | 374 | 7.1% |
| 3 | 102 | 1.9% |
| 4 | 102 | 1.9% |
| 5 | 102 | 1.9% |
| 6 | 102 | 1.9% |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 KiB |
| 2019 | |
|---|---|
| 2018 | |
| 2020 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2992 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2018 |
|---|---|
| 2nd row | 2018 |
| 3rd row | 2018 |
| 4th row | 2018 |
| 5th row | 2018 |
| Value | Count | Frequency (%) |
| 2019 | 306 | |
| 2018 | 306 | |
| 2020 | 136 |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 2018 | 306 | |
| 2019 | 306 | |
| 2020 | 136 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 884 | |
| 0 | 884 | |
| 1 | 612 | |
| 8 | 306 | 10.2% |
| 9 | 306 | 10.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2992 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 2 | 884 | |
| 0 | 884 | |
| 1 | 612 | |
| 8 | 306 | 10.2% |
| 9 | 306 | 10.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2992 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 2 | 884 | |
| 0 | 884 | |
| 1 | 612 | |
| 8 | 306 | 10.2% |
| 9 | 306 | 10.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2992 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 2 | 884 | |
| 0 | 884 | |
| 1 | 612 | |
| 8 | 306 | 10.2% |
| 9 | 306 | 10.2% |
Mes
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.545454545 |
|---|---|
| Minimum | 3 |
| Maximum | 11 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 11 |
| Maximum | 11 |
| Range | 8 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.573017893 |
|---|---|
| Coefficient of variation (CV) | 0.3930999559 |
| Kurtosis | -1.157823685 |
| Mean | 6.545454545 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.2690193841 |
| Sum | 4896 |
| Variance | 6.620421078 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=9)
| Value | Count | Frequency (%) |
| 6 | 102 | |
| 5 | 102 | |
| 4 | 102 | |
| 3 | 102 | |
| 11 | 68 | |
| 10 | 68 | |
| 9 | 68 | |
| 8 | 68 | |
| 7 | 68 |
| Value | Count | Frequency (%) |
| 3 | 102 | |
| 4 | 102 | |
| 5 | 102 | |
| 6 | 102 | |
| 7 | 68 | |
| 8 | 68 | |
| 9 | 68 | |
| 10 | 68 | |
| 11 | 68 |
| Value | Count | Frequency (%) |
| 11 | 68 | |
| 10 | 68 | |
| 9 | 68 | |
| 8 | 68 | |
| 7 | 68 | |
| 6 | 102 | |
| 5 | 102 | |
| 4 | 102 | |
| 3 | 102 |
| Distinct | 17 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.1 KiB |
| Castilla-La Mancha | 44 |
|---|---|
| Aragón | 44 |
| Canarias | 44 |
| La Rioja | 44 |
| Castilla y León | 44 |
| Other values (12) |
Length
| Max length | 26 |
|---|---|
| Median length | 13 |
| Mean length | 13.88235294 |
| Min length | 6 |
Characters and Unicode
| Total characters | 10384 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Andalucía |
|---|---|
| 2nd row | Aragón |
| 3rd row | Principado de Asturias |
| 4th row | Illes Balears |
| 5th row | Canarias |
| Value | Count | Frequency (%) |
| Castilla-La Mancha | 44 | 5.9% |
| Aragón | 44 | 5.9% |
| Canarias | 44 | 5.9% |
| La Rioja | 44 | 5.9% |
| Castilla y León | 44 | 5.9% |
| Cantabria | 44 | 5.9% |
| Comunitat Valenciana | 44 | 5.9% |
| Galicia | 44 | 5.9% |
| Región de Murcia | 44 | 5.9% |
| Principado de Asturias | 44 | 5.9% |
| Other values (7) | 308 |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| de | 176 | 12.1% |
| comunidad | 88 | 6.1% |
| comunitat | 44 | 3.0% |
| murcia | 44 | 3.0% |
| vasco | 44 | 3.0% |
| galicia | 44 | 3.0% |
| castilla | 44 | 3.0% |
| cataluña\/catalunya | 44 | 3.0% |
| la | 44 | 3.0% |
| illes | 44 | 3.0% |
| Other values (19) | 836 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2024 | |
| i | 748 | 7.2% |
| 704 | 6.8% | |
| n | 616 | 5.9% |
| d | 572 | 5.5% |
| l | 572 | 5.5% |
| r | 572 | 5.5% |
| e | 440 | 4.2% |
| u | 396 | 3.8% |
| s | 396 | 3.8% |
| Other values (31) | 3344 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8228 | |
| Uppercase Letter | 1320 | 12.7% |
| Space Separator | 704 | 6.8% |
| Other Punctuation | 88 | 0.8% |
| Dash Punctuation | 44 | 0.4% |
Most frequent character per category
| Value | Count | Frequency (%) |
| a | 2024 | |
| i | 748 | 9.1% |
| n | 616 | 7.5% |
| d | 572 | 7.0% |
| l | 572 | 7.0% |
| r | 572 | 7.0% |
| e | 440 | 5.3% |
| u | 396 | 4.8% |
| s | 396 | 4.8% |
| t | 396 | 4.8% |
| Other values (14) | 1496 |
| Value | Count | Frequency (%) |
| C | 396 | |
| A | 132 | 10.0% |
| L | 132 | 10.0% |
| M | 132 | 10.0% |
| P | 88 | 6.7% |
| R | 88 | 6.7% |
| V | 88 | 6.7% |
| I | 44 | 3.3% |
| B | 44 | 3.3% |
| E | 44 | 3.3% |
| Other values (3) | 132 | 10.0% |
| Value | Count | Frequency (%) |
| \ | 44 | |
| / | 44 |
| Value | Count | Frequency (%) |
| 704 |
| Value | Count | Frequency (%) |
| - | 44 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9548 | |
| Common | 836 | 8.1% |
Most frequent character per script
| Value | Count | Frequency (%) |
| a | 2024 | |
| i | 748 | 7.8% |
| n | 616 | 6.5% |
| d | 572 | 6.0% |
| l | 572 | 6.0% |
| r | 572 | 6.0% |
| e | 440 | 4.6% |
| u | 396 | 4.1% |
| s | 396 | 4.1% |
| t | 396 | 4.1% |
| Other values (27) | 2816 |
| Value | Count | Frequency (%) |
| 704 | ||
| - | 44 | 5.3% |
| \ | 44 | 5.3% |
| / | 44 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10120 | |
| None | 264 | 2.5% |
Most frequent character per block
| Value | Count | Frequency (%) |
| a | 2024 | |
| i | 748 | 7.4% |
| 704 | 7.0% | |
| n | 616 | 6.1% |
| d | 572 | 5.7% |
| l | 572 | 5.7% |
| r | 572 | 5.7% |
| e | 440 | 4.3% |
| u | 396 | 3.9% |
| s | 396 | 3.9% |
| Other values (28) | 3080 |
| Value | Count | Frequency (%) |
| ó | 132 | |
| í | 88 | |
| ñ | 44 | 16.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 54.9 KiB |
| T.HORTALIZAS FRESCAS | |
|---|---|
| T.FRUTAS FRESCAS |
Length
| Max length | 20 |
|---|---|
| Median length | 18 |
| Mean length | 18 |
| Min length | 16 |
Characters and Unicode
| Total characters | 13464 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | T.HORTALIZAS FRESCAS |
|---|---|
| 2nd row | T.HORTALIZAS FRESCAS |
| 3rd row | T.HORTALIZAS FRESCAS |
| 4th row | T.HORTALIZAS FRESCAS |
| 5th row | T.HORTALIZAS FRESCAS |
| Value | Count | Frequency (%) |
| T.HORTALIZAS FRESCAS | 374 | |
| T.FRUTAS FRESCAS | 374 |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| frescas | 748 | |
| t.frutas | 374 | |
| t.hortalizas | 374 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2244 | |
| A | 1870 | |
| T | 1496 | |
| R | 1496 | |
| F | 1122 | |
| . | 748 | 5.6% |
| 748 | 5.6% | |
| E | 748 | 5.6% |
| C | 748 | 5.6% |
| H | 374 | 2.8% |
| Other values (5) | 1870 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 11968 | |
| Other Punctuation | 748 | 5.6% |
| Space Separator | 748 | 5.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| S | 2244 | |
| A | 1870 | |
| T | 1496 | |
| R | 1496 | |
| F | 1122 | |
| E | 748 | 6.2% |
| C | 748 | 6.2% |
| H | 374 | 3.1% |
| O | 374 | 3.1% |
| L | 374 | 3.1% |
| Other values (3) | 1122 |
| Value | Count | Frequency (%) |
| . | 748 |
| Value | Count | Frequency (%) |
| 748 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11968 | |
| Common | 1496 | 11.1% |
Most frequent character per script
| Value | Count | Frequency (%) |
| S | 2244 | |
| A | 1870 | |
| T | 1496 | |
| R | 1496 | |
| F | 1122 | |
| E | 748 | 6.2% |
| C | 748 | 6.2% |
| H | 374 | 3.1% |
| O | 374 | 3.1% |
| L | 374 | 3.1% |
| Other values (3) | 1122 |
| Value | Count | Frequency (%) |
| . | 748 | |
| 748 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13464 |
Most frequent character per block
| Value | Count | Frequency (%) |
| S | 2244 | |
| A | 1870 | |
| T | 1496 | |
| R | 1496 | |
| F | 1122 | |
| . | 748 | 5.6% |
| 748 | 5.6% | |
| E | 748 | 5.6% |
| C | 748 | 5.6% |
| H | 374 | 2.8% |
| Other values (5) | 1870 |
| Distinct | 748 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17775.62045 |
|---|---|
| Minimum | 965.63 |
| Maximum | 74537.73 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 KiB |
Quantile statistics
| Minimum | 965.63 |
|---|---|
| 5-th percentile | 2129.3885 |
| Q1 | 6456.08 |
| median | 10615.795 |
| Q3 | 24196.0375 |
| 95-th percentile | 56062.028 |
| Maximum | 74537.73 |
| Range | 73572.1 |
| Interquartile range (IQR) | 17739.9575 |
Descriptive statistics
| Standard deviation | 16676.60417 |
|---|---|
| Coefficient of variation (CV) | 0.9381728314 |
| Kurtosis | 1.235871105 |
| Mean | 17775.62045 |
| Median Absolute Deviation (MAD) | 6314.4 |
| Skewness | 1.438292125 |
| Sum | 13296164.1 |
| Variance | 278109126.7 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 27674.86 | 1 | 0.1% |
| 34050.87 | 1 | 0.1% |
| 10900.84 | 1 | 0.1% |
| 41835.07 | 1 | 0.1% |
| 24676.78 | 1 | 0.1% |
| 9433.73 | 1 | 0.1% |
| 56467.11 | 1 | 0.1% |
| 10759.11 | 1 | 0.1% |
| 10581.5 | 1 | 0.1% |
| 2280.09 | 1 | 0.1% |
| Other values (738) | 738 |
| Value | Count | Frequency (%) |
| 965.63 | 1 | |
| 1112 | 1 | |
| 1119.7 | 1 | |
| 1162.95 | 1 | |
| 1181.04 | 1 | |
| 1228.54 | 1 | |
| 1235.79 | 1 | |
| 1333.46 | 1 | |
| 1333.78 | 1 | |
| 1383.15 | 1 |
| Value | Count | Frequency (%) |
| 74537.73 | 1 | |
| 71883.85 | 1 | |
| 71869.97 | 1 | |
| 71261.38 | 1 | |
| 69570.32 | 1 | |
| 69299.62 | 1 | |
| 69084 | 1 | |
| 68945.41 | 1 | |
| 68311.03 | 1 | |
| 68180.69 | 1 |
| Distinct | 748 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30127.16766 |
|---|---|
| Minimum | 1752.05 |
| Maximum | 152351.12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 KiB |
Quantile statistics
| Minimum | 1752.05 |
|---|---|
| 5-th percentile | 3876.7035 |
| Q1 | 11580.1675 |
| median | 18621.025 |
| Q3 | 42810.8075 |
| 95-th percentile | 89290.8045 |
| Maximum | 152351.12 |
| Range | 150599.07 |
| Interquartile range (IQR) | 31230.64 |
Descriptive statistics
| Standard deviation | 27942.9279 |
|---|---|
| Coefficient of variation (CV) | 0.9274993325 |
| Kurtosis | 1.505808811 |
| Mean | 30127.16766 |
| Median Absolute Deviation (MAD) | 10961.045 |
| Skewness | 1.463166068 |
| Sum | 22535121.41 |
| Variance | 780807219.4 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 6336.3 | 1 | 0.1% |
| 3919.47 | 1 | 0.1% |
| 23184.39 | 1 | 0.1% |
| 88184.01 | 1 | 0.1% |
| 30056.11 | 1 | 0.1% |
| 70794.27 | 1 | 0.1% |
| 80538.85 | 1 | 0.1% |
| 8694.1 | 1 | 0.1% |
| 5090.15 | 1 | 0.1% |
| 6706.17 | 1 | 0.1% |
| Other values (738) | 738 |
| Value | Count | Frequency (%) |
| 1752.05 | 1 | |
| 2142.06 | 1 | |
| 2257.54 | 1 | |
| 2407.06 | 1 | |
| 2454.1 | 1 | |
| 2491.4 | 1 | |
| 2581.08 | 1 | |
| 2610.99 | 1 | |
| 2720.22 | 1 | |
| 2785.61 | 1 |
| Value | Count | Frequency (%) |
| 152351.12 | 1 | |
| 136585.14 | 1 | |
| 134417.36 | 1 | |
| 125592.17 | 1 | |
| 120584.52 | 1 | |
| 119607.84 | 1 | |
| 117787.65 | 1 | |
| 116167.34 | 1 | |
| 114096.11 | 1 | |
| 113798.92 | 1 |
Precio medio kg
Real number (ℝ≥0)
| Distinct | 99 |
|---|---|
| Distinct (%) | 13.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.732954545 |
|---|---|
| Minimum | 1.14 |
| Maximum | 2.23 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 KiB |
Quantile statistics
| Minimum | 1.14 |
|---|---|
| 5-th percentile | 1.39 |
| Q1 | 1.59 |
| median | 1.74 |
| Q3 | 1.89 |
| 95-th percentile | 2.04 |
| Maximum | 2.23 |
| Range | 1.09 |
| Interquartile range (IQR) | 0.3 |
Descriptive statistics
| Standard deviation | 0.2030311483 |
|---|---|
| Coefficient of variation (CV) | 0.1171589577 |
| Kurtosis | -0.3733476205 |
| Mean | 1.732954545 |
| Median Absolute Deviation (MAD) | 0.15 |
| Skewness | -0.2139075562 |
| Sum | 1296.25 |
| Variance | 0.04122164719 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.72 | 18 | 2.4% |
| 1.9 | 18 | 2.4% |
| 1.87 | 17 | 2.3% |
| 1.86 | 17 | 2.3% |
| 1.81 | 17 | 2.3% |
| 1.54 | 17 | 2.3% |
| 1.61 | 16 | 2.1% |
| 1.68 | 16 | 2.1% |
| 1.67 | 16 | 2.1% |
| 1.74 | 15 | 2.0% |
| Other values (89) | 581 |
| Value | Count | Frequency (%) |
| 1.14 | 1 | 0.1% |
| 1.15 | 1 | 0.1% |
| 1.2 | 1 | 0.1% |
| 1.21 | 2 | |
| 1.23 | 1 | 0.1% |
| 1.24 | 3 | |
| 1.26 | 1 | 0.1% |
| 1.28 | 2 | |
| 1.29 | 3 | |
| 1.3 | 3 |
| Value | Count | Frequency (%) |
| 2.23 | 1 | 0.1% |
| 2.21 | 1 | 0.1% |
| 2.2 | 1 | 0.1% |
| 2.18 | 3 | |
| 2.16 | 1 | 0.1% |
| 2.15 | 1 | 0.1% |
| 2.14 | 3 | |
| 2.13 | 4 | |
| 2.12 | 2 | |
| 2.11 | 1 | 0.1% |
Penetración (%)
Real number (ℝ≥0)
| Distinct | 488 |
|---|---|
| Distinct (%) | 65.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 95.64737968 |
|---|---|
| Minimum | 83.7 |
| Maximum | 99.93 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 KiB |
Quantile statistics
| Minimum | 83.7 |
|---|---|
| 5-th percentile | 90.8835 |
| Q1 | 94.57 |
| median | 96.215 |
| Q3 | 97.365 |
| 95-th percentile | 98.6265 |
| Maximum | 99.93 |
| Range | 16.23 |
| Interquartile range (IQR) | 2.795 |
Descriptive statistics
| Standard deviation | 2.47322656 |
|---|---|
| Coefficient of variation (CV) | 0.02585775552 |
| Kurtosis | 2.476143849 |
| Mean | 95.64737968 |
| Median Absolute Deviation (MAD) | 1.335 |
| Skewness | -1.349754378 |
| Sum | 71544.24 |
| Variance | 6.116849617 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 96.4 | 6 | 0.8% |
| 97.89 | 5 | 0.7% |
| 97.4 | 5 | 0.7% |
| 97.44 | 4 | 0.5% |
| 95.96 | 4 | 0.5% |
| 96.09 | 4 | 0.5% |
| 96.53 | 4 | 0.5% |
| 95.81 | 4 | 0.5% |
| 98.02 | 4 | 0.5% |
| 96.73 | 4 | 0.5% |
| Other values (478) | 704 |
| Value | Count | Frequency (%) |
| 83.7 | 1 | |
| 84.97 | 1 | |
| 85.18 | 1 | |
| 85.76 | 1 | |
| 85.94 | 1 | |
| 86.34 | 1 | |
| 87.01 | 1 | |
| 87.04 | 1 | |
| 87.12 | 1 | |
| 87.31 | 1 |
| Value | Count | Frequency (%) |
| 99.93 | 1 | |
| 99.76 | 1 | |
| 99.75 | 1 | |
| 99.56 | 1 | |
| 99.55 | 1 | |
| 99.43 | 1 | |
| 99.41 | 1 | |
| 99.36 | 1 | |
| 99.21 | 1 | |
| 99.2 | 1 |
| Distinct | 466 |
|---|---|
| Distinct (%) | 62.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.669251337 |
|---|---|
| Minimum | 2.78 |
| Maximum | 14.29 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 KiB |
Quantile statistics
| Minimum | 2.78 |
|---|---|
| 5-th percentile | 3.81 |
| Q1 | 4.8875 |
| median | 6.475 |
| Q3 | 8.2025 |
| 95-th percentile | 10.2665 |
| Maximum | 14.29 |
| Range | 11.51 |
| Interquartile range (IQR) | 3.315 |
Descriptive statistics
| Standard deviation | 2.071850598 |
|---|---|
| Coefficient of variation (CV) | 0.3106571478 |
| Kurtosis | -0.600069684 |
| Mean | 6.669251337 |
| Median Absolute Deviation (MAD) | 1.65 |
| Skewness | 0.3977149357 |
| Sum | 4988.6 |
| Variance | 4.292564901 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 7.52 | 7 | 0.9% |
| 7.17 | 7 | 0.9% |
| 4.97 | 4 | 0.5% |
| 8.12 | 4 | 0.5% |
| 3.85 | 4 | 0.5% |
| 4.48 | 4 | 0.5% |
| 8.03 | 4 | 0.5% |
| 5.56 | 4 | 0.5% |
| 5.35 | 4 | 0.5% |
| 5.1 | 4 | 0.5% |
| Other values (456) | 702 |
| Value | Count | Frequency (%) |
| 2.78 | 1 | |
| 3.05 | 1 | |
| 3.16 | 1 | |
| 3.25 | 1 | |
| 3.35 | 2 | |
| 3.37 | 1 | |
| 3.38 | 1 | |
| 3.41 | 2 | |
| 3.43 | 1 | |
| 3.46 | 2 |
| Value | Count | Frequency (%) |
| 14.29 | 1 | |
| 12.86 | 1 | |
| 12.31 | 1 | |
| 11.89 | 1 | |
| 11.7 | 1 | |
| 11.56 | 1 | |
| 11.51 | 1 | |
| 11.38 | 1 | |
| 11.26 | 1 | |
| 11.2 | 1 |
| Distinct | 557 |
|---|---|
| Distinct (%) | 74.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.37536096 |
|---|---|
| Minimum | 5.54 |
| Maximum | 27.63 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.0 KiB |
Quantile statistics
| Minimum | 5.54 |
|---|---|
| 5-th percentile | 6.9235 |
| Q1 | 8.935 |
| median | 10.935 |
| Q3 | 13.2525 |
| 95-th percentile | 17.2995 |
| Maximum | 27.63 |
| Range | 22.09 |
| Interquartile range (IQR) | 4.3175 |
Descriptive statistics
| Standard deviation | 3.324742988 |
|---|---|
| Coefficient of variation (CV) | 0.292275823 |
| Kurtosis | 1.154433315 |
| Mean | 11.37536096 |
| Median Absolute Deviation (MAD) | 2.16 |
| Skewness | 0.9066751329 |
| Sum | 8508.77 |
| Variance | 11.05391593 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 15.76 | 4 | 0.5% |
| 11.01 | 4 | 0.5% |
| 12.02 | 4 | 0.5% |
| 10.55 | 4 | 0.5% |
| 11.38 | 3 | 0.4% |
| 10.17 | 3 | 0.4% |
| 12.4 | 3 | 0.4% |
| 12.04 | 3 | 0.4% |
| 13.9 | 3 | 0.4% |
| 12.2 | 3 | 0.4% |
| Other values (547) | 714 |
| Value | Count | Frequency (%) |
| 5.54 | 1 | |
| 5.61 | 1 | |
| 5.66 | 1 | |
| 5.69 | 1 | |
| 5.72 | 1 | |
| 5.76 | 1 | |
| 5.9 | 1 | |
| 6.21 | 1 | |
| 6.26 | 1 | |
| 6.36 | 1 |
| Value | Count | Frequency (%) |
| 27.63 | 1 | |
| 24.24 | 1 | |
| 23.07 | 1 | |
| 22.22 | 1 | |
| 22.05 | 1 | |
| 21.45 | 1 | |
| 21.39 | 1 | |
| 21.26 | 1 | |
| 20.98 | 1 | |
| 20.75 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| Fecha | Año | Mes | CCAA | Producto | Volumen (miles de kg) | Valor (miles de €) | Precio medio kg | Penetración (%) | Consumo per capita | Gasto per capita | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2018-03 | 2018 | 3 | Andalucía | T.HORTALIZAS FRESCAS | 38505.79 | 66399.93 | 1.72 | 96.93 | 4.43 | 7.64 |
| 1 | 2018-03 | 2018 | 3 | Aragón | T.HORTALIZAS FRESCAS | 7578.78 | 13834.07 | 1.83 | 98.11 | 5.77 | 10.54 |
| 2 | 2018-03 | 2018 | 3 | Principado de Asturias | T.HORTALIZAS FRESCAS | 3701.71 | 7008.99 | 1.89 | 93.71 | 3.41 | 6.46 |
| 3 | 2018-03 | 2018 | 3 | Illes Balears | T.HORTALIZAS FRESCAS | 5728.03 | 10921.54 | 1.91 | 95.81 | 5.56 | 10.60 |
| 4 | 2018-03 | 2018 | 3 | Canarias | T.HORTALIZAS FRESCAS | 10900.84 | 19963.16 | 1.83 | 97.72 | 4.99 | 9.13 |
| 5 | 2018-03 | 2018 | 3 | Cantabria | T.HORTALIZAS FRESCAS | 1875.96 | 3166.49 | 1.69 | 95.39 | 3.37 | 5.69 |
| 6 | 2018-03 | 2018 | 3 | Castilla-La Mancha | T.HORTALIZAS FRESCAS | 9127.26 | 15770.06 | 1.73 | 97.94 | 4.35 | 7.52 |
| 7 | 2018-03 | 2018 | 3 | Castilla y León | T.HORTALIZAS FRESCAS | 8873.43 | 15699.99 | 1.77 | 94.57 | 3.63 | 6.42 |
| 8 | 2018-03 | 2018 | 3 | Cataluña\/Catalunya | T.HORTALIZAS FRESCAS | 44416.70 | 81759.89 | 1.84 | 97.76 | 6.42 | 11.82 |
| 9 | 2018-03 | 2018 | 3 | Extremadura | T.HORTALIZAS FRESCAS | 4532.87 | 8213.68 | 1.81 | 98.83 | 3.84 | 6.97 |
Last rows
| Fecha | Año | Mes | CCAA | Producto | Volumen (miles de kg) | Valor (miles de €) | Precio medio kg | Penetración (%) | Consumo per capita | Gasto per capita | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 738 | 2020-06 | 2020 | 6 | Castilla y León | T.FRUTAS FRESCAS | 25494.94 | 42843.78 | 1.68 | 95.47 | 11.12 | 18.69 |
| 739 | 2020-06 | 2020 | 6 | Cataluña\/Catalunya | T.FRUTAS FRESCAS | 68945.41 | 136585.14 | 1.98 | 96.86 | 9.98 | 19.77 |
| 740 | 2020-06 | 2020 | 6 | Extremadura | T.FRUTAS FRESCAS | 9673.89 | 16051.33 | 1.66 | 96.93 | 8.63 | 14.32 |
| 741 | 2020-06 | 2020 | 6 | Galicia | T.FRUTAS FRESCAS | 26235.77 | 50627.47 | 1.93 | 94.98 | 10.11 | 19.50 |
| 742 | 2020-06 | 2020 | 6 | La Rioja | T.FRUTAS FRESCAS | 2419.76 | 4879.42 | 2.02 | 99.03 | 7.88 | 15.90 |
| 743 | 2020-06 | 2020 | 6 | Comunidad de Madrid | T.FRUTAS FRESCAS | 62836.45 | 112799.10 | 1.80 | 97.16 | 10.32 | 18.53 |
| 744 | 2020-06 | 2020 | 6 | Región de Murcia | T.FRUTAS FRESCAS | 14095.66 | 22926.48 | 1.63 | 96.16 | 10.15 | 16.51 |
| 745 | 2020-06 | 2020 | 6 | Comunidad Foral de Navarra | T.FRUTAS FRESCAS | 6188.89 | 11851.16 | 1.91 | 96.03 | 10.78 | 20.63 |
| 746 | 2020-06 | 2020 | 6 | País Vasco | T.FRUTAS FRESCAS | 22219.73 | 43357.87 | 1.95 | 97.85 | 10.96 | 21.39 |
| 747 | 2020-06 | 2020 | 6 | Comunitat Valenciana | T.FRUTAS FRESCAS | 43709.98 | 78115.91 | 1.79 | 95.94 | 9.38 | 16.77 |